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Abstract 

A key fact in the theory of Boolean functions / : {0, 1}™ — > {0, 1} is that they often undergo 
sharp thresholds. For example: if the function / : {0, l}' 1 — > {0, 1} is monotone and symmetric 
under a transitive action with E p [/] = e and E q [/] = 1 — e then q — p — > as n . — >• oo. Here 
E p denotes the product probability measure on {0, l} n where each coordinate takes the value 1 
independently with probability p. 

The fact that symmetric functions undergo sharp thresholds is important in the study of 
random graphs and constraint satisfaction problems as well as in social choice. 

In this paper we prove sharp thresholds for monotone functions taking values in an arbitrary 
finite sets. We also provide examples of applications of the results to social choice and to random 
graph problems. 

Among the applications is an analog for Condorcet's jury theorem and an indeterminacy 
result for a large class of social choice functions. 

1 Introduction 

1.1 Sharp thresholds 

A key fact in the theory of Boolean functions is that monotone symmetric functions undergo sharp 
thresholds. This fact has fundamental significance in the study of constraint satisfaction problems, 
random graph processes and percolation and in social choice. 

The results of [4] show that for every < e < 1/2 there exists a C > such that for all n and 
all / : {0, l} n —7- {0, 1} which is monotone and symmetric (see definitions below) if E p [/] = e and 
E 9 [/] = 1 — e then < q — p < C(logn) . This result implies in particular that "Every monotone 
graph property has a sharp threshold" as [4] is titled. It also implies that symmetric voting systems 
aggregate information well, see for example [5] and the examples provided in the current paper. 
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1.2 Notation and Main Results 



Let A be a finite set. Let X = A n . For a € S(n), a permutation on n elements and x € A n we denote 
by y = x a the vector satisfying y, = x CT (j) for all i. For cr € 5*(t4) we write y = cr(x) for the vector 
satisfying y^ = a(xi) for all i. For a £ A and x, y € X we write x < a y if {i : Xj = a} C {i : y% = a} 
and for all i such that y^^ ait holds that Xj = y^. In other words if x < a y then for all i if Xj ^ y,; 
then yj = a. It is easy to see that < a defines a partial order on X. 

We say that f : X = A n — > A is monotone if for all a € A and x, y 6 X such that x < a y it 
holds that f(x) = a implies that /(y) = a. We say that / is symmetric if there exists a transitive 
group S C <S(n) such that f(x a ) = /(x) for all x € X and <r € X. We say that / is fair if for all 
a € 5(A) and all x G X it holds that f(a(x)) = a(f(x)). 

Let A [A] denote the simplex of probability measures on A and let 7 denote the standard prob- 
ability measure on A [A]. For [i € AL4] denote by the measure ii® n on X. We denote by 
the expected value according to the measure P^. For any measure \i € A [A], we write (i* for the 
minimal probability ji assigns to any of the atoms in A. 

In our main result we show that: 

Theorem 1.1. There exists an absolute constant C = C{\A\) such that if f is symmetric and 
monotone then for any a € A and e > it holds that 

l[^.e< P,Af = a] < 1 - e] < C(log(l - e) - log(e))^^. 

logn 

The result above establishes sharp threshold for symmetric functions as it shows that for almost 
all probability measures / takes one specific value with probability at least 1 — e. 

Remark 1.2. It is interesting to compare the results established here to those of [4]. For \A\ = 2 
our results give a threshold interval of length 0(log log nj log n) compared to the results of [4] which 
give threshold interval of length 0(1/ log n). In the binary case the later result is tight. It is natural 
to conjecture that the threshold is always of measure 0(1/ logn). 

1.2.1 Other applications 

Various applications of the threshold result to problems involving distributions of edge colored 
graphs are given in subsection 3.2. 



1.3 Social choice background, and applications 

We will describe two main applications of our main result to social choice theory. The first ap- 
plication gives an extension of Condorcet's Jury theorem for monotone choice functions for more 
than two candidates and large classes of voting rules. The second application is to indeterminacy 
results for generalized social choice functions. 
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1.3.1 Aggregation of Information 



The law of large numbers implies that in an election between two candidates denoted and 1, if 
every voter votes for with probability p > 1/2 and for 1 with probability 1—p and if these votes 
are independent, then as the number of voters tends to infinity the probability that will be elected 
tends to one. This fact is referred to as Condorcet's Jury Theorem. 

This theorem can be interpreted as saying that even if agents receive very poor but independent 
signals indicating which decision is correct, majority voting will nevertheless result in the correct 
decision being taken with a high probability if there are enough agents (and each agent votes 
according to the signal he receives). This phenomenon is referred to as asymptotically complete 
aggregation of information and it plays an important role in theoretical economics. 

More recent results studied aggregation of information for general symmetric fair functions 
/ : {0, l} n — >■ {0, 1}. Recall that such a function is fair if f(l — x±, . . . , 1 — x n ) = 1 — f(x±, . . . , x n ). 
In this setup the results of [4] imply that for every p > 1/2 and every symmetric fair function on 
n voters it holds E p [/] > 1 — o(l). 

Here we derive a similar result in the case of an election between [q] candidates. Note that the 
conditions of monotonicity and fairness are both natural in this setup. 

• Monotonicity implies that if in a certain vector x £ [q] n of voters a certain candidate i is 
elected and if y is identical to x except that some of the voters changed their mind as to vote 
i, then the outcome of the vote for y should also be i. 

• Fairness means that all the candidates are treated equally. 

Theorem 1.3. For every q there exists a constant C = C{q) for which the following holds for every 
e < 1/3. Let p £ A(q) and i £ [q] satisfy that 

p(i) > max Mj) + C(log(l - e) - log(l/<z))^^. 

logn 

Then for every fair monotone function f : [q] n — >• [q] it holds that 

p[f = i]>l- e. 

In words - the proposition claims that for any measure on the votes that has a log log n / log n 
bias towards one of the candidates i and any fair monotone voting function it holds that i will be 
elected with high probability. The proof is given in subsection 3.1. 

1.3.2 Indeterminacy 

Arrow's impossibility theorem asserts that under certain natural conditions, if there are at least 
three alternatives then every non-dictatorial social choice gives rise to a non-rational choice function, 
i.e., there exist profiles such that the social choice is not rational. Arrow's theorem can be seen 
in the context of Condorcet's "paradox" which demonstrates that the majority rule may result 
in the society preferring A over B , B over C and C over A. Arrow's theorem shows that such 
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"paradoxes" cannot be avoided with any non-dictatorial voting method. It is the general form 
of Arrow's theorem, which can be applied to general schemes for aggregating individual rational 
choices, that made it so important in economic theory. 

McGarvey [7] appears to have been the first to show that for every asymmetric relation R on a 
finite set of candidates there is a strict-preferences (linear orders, no ties) voter profile that has the 
relation R as its strict simple majority relation. This implies that we cannot deduce the society's 
choice between two candidates even if we know the society's choice between every other pair of 
candidates. We refer to this phenomena as "complete indeterminacy". 

Saari [9] proved that the plurality method gives rise to every choice function for sufficiently 
large societies. (In fact, he proved more: the ranking on any subset of the alternatives can be 
prescribed). This implies that knowing the outcome of the plurality choice for several examples, 
where each example consists of a set S of alternatives and the chosen element c(S) for S, cannot 
teach us anything about the outcome for a set of alternatives which is not among the examples we 
have already seen. 

In [5] McGarvey's theorem is extended to sequences of neutral social welfare functions in which 
the maximum Shapley-Shubik power index tends to 0. The proof relied on threshold properties of 
Boolean functions. In particular, McGarvey's theorem extends to neutral social welfare functions 
which are invariant under a transitive group of permutations of the individuals. We describe a 
similar extension of Saari's theorem. 

2 Proof of the main theorem 

The proof follows the same simple idea used in [4]. That is, we use information on the influences 
in order to deduce that E„[/] varies quickly as a function of jjl. We thus derive a generalization of 
Russo's formula [6, 8] which expresses derivatives in terms of influences. We then use the fact that 
functions taking only a bounded number of values must have large influence sums. 

Depending on the type of influence sum bounds, one obtain different results. Using the results 
of [1] it is possible to obtain a bound on the threshold interval length of order 0(log 1//3 n).Here 
we derive the better bound of order O (log log nj log n) using a generalization of the results of 
Talagrand [10]. This generalization was proven in a course taught by the second author on Fall 
2005. A draft of the proof was written in scribe notes of the course by Asaf Nachmias at 
http : / /www. stat.berkeley.edu/ ~ mossel/teach/206af05/scribes/oct25.pdf. For completeness 
we have corrected and completed the proof. 

It seems like in order to obtain a threshold bound of 0(1/ log n) it would be needed to derive a 
tighter influence sum bound tailored to the setup here. 

From now on, without loss of generality assume A = [q] = {0, 1 . . . , q — 1}. We will consider 
functions / : [q] n — > {0, 1} and say that such a function is 0-monotone if x <o y implies that 
f{x) < f(y). 

Clearly in order to prove Theorem 1.1, it suffices to prove that there exists C > such that if 
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/ is symmetric and O-monotone then for all e > it holds that 

7[M : e < P M [/ = 0] < 1 - e] < C(log(l - e) - log(e))^|^. (1) 

2.1 Influences 

We will use the L2 influence 
Let / : X -> K. 

Let 11(f), the i'th influence of / according to /x 6 A [A] by 

^(7) = E M [Var[/|xi,. . . , . . .,%]] 

In Corollary 5.7 we will derive the following lower bound on influence sum which is a consequence 
of a generalization of a result of Talagrand [10] . We restate the corollary here: 

Corollary 2.1. There exists some universal constant C such that for any probability space 
and any function f : Q n — > {0, 1} which is symmetric it holds that it holds that 

g w,£ i^(iM log " Var(/) - 

2.2 Russo Type Formula 

We denote by T(A) the tangent space to S(A). The space is easily identified with the space of all 
vectors in t € M. A satisfying ^ ij = 0. The natural derivative on T(A) satisfies dPu[f]/dt € T(A). 

Lemma 2.2. Let f ■ [q] — > {0, 1} be a 0-monotone function and [i a measure on [q] and suppose 
that f is 0-monotone. Write fi = (1 — //(0))// + /u(0)<5o where /z'(0) = and // is a probability 
measure. Let t = So — // G T(A). Then 

d^+ht[f] 



dh 



1(/ is noi constant) fi [1 — f] > Var^/f/]. 



Proof. By linearity the derivative equals /(0) — //[/]• The last expression is when / is constant. 
If / is not constant then by monotonicity it holds that f(0) = 1 and therefore the derivative is 
1 — //[/] as needed. Finally, the inequality follows from the fact that if / is constant then the 
variance is and otherwise fi'[l — /] > — /] = Vaiyf/]. □ 

We now prove a generalization of Russo's formula. 

Lemma 2.3. Let f : [q] n — > {0,1} be a 0-monotone function and [i € A\q\. Write jjL = (1 — 
fj,(0))fj,' + /j,(0)5q where n'(0) = and fj! is a probability measure. Let t = 6q — // £ T(A). Then: 

y , _ = E E ^ MfW is not constant ) e m'[! - > EW)> 

on \h — . „ . 

8=1 1=1 

where Fi denotes conditioning on x±, . . . , a^+i, . . . ,x n . 
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Proof. Use the chain rule: 

dh \h=o dh \h=o ^ dhi \(hi,...,h n )=o 

and Lemma 2.2. □ 

2.3 Proof of the Main Result 

We now prove the main result, i.e., (1). In the proof below the constants c will denote different 
constant at different lines. All of them depend on q only. 

Proof. Let T denote the set of measures fi in A[q] satisfying the following: 

• /x[0] = 0. 

• iA}\ — v f° r an * 7^ 0- 

Write /i* = t5o + (1 — t)fi and note that 

j[^:0<t<l,fi^T] <C(q) V . (2) 
For each measure \i € F we look at the interval J^' 1-6 , where 

jf- e = {teI^.e<P^[f]<l-e}. 

Our goal is to show that the interval J^' 1 e is short by bounding the derivative of 

G{t)=Af\ 

in the interval. In fact we will bound the length of each of the sub-intervals J^' 1 ^ 2 and J^ 2 ' 1 e . 
Since / is 0, 1 valued, it follows that in the interval J^' 1 ^ 2 it holds that Var„t[/] > | By 
Lemma 2.3 and Proposition 2.1 we conclude that: 



i=i 

This implies that: 



(lnG)'(t)> Cl ° gn 



log(l/r/) ' 

In particular if G(p) = e and G(q) = 1/2 then: 

ln(l/2) - ln(e) > bg(i/7?) . 

S °' , , < (ln(l/2)-ln(e))log(lA ? ) 

\p-q\< ; ■ 

c log n 
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Repeating the same argument for the interval (1/2, 1 — e) we obtain that 



Te , e . ^ (ln(l-e)-ln( e ))log(l/r ? ) 



clog n 

Which together (2) with implies that 

\ logn / 

Taking r/ = (ln(l — e) — ln(e))/logn we obtain the bound: 

C W + ( ln ( X ~ e ) ~ ln ( £ )) TT~ ^ 2C W) ((M 1 - e ) - ln ( e )) 



logn logn J logn 

as needed. 

□ 

3 Applications 

3.1 A general form of Condorcet's Jury theorem 

We begin with a proof of Theorem 1.3. Since the proof is similar to the previous proof, we only 
sketch the main steps. 

Proof. We only sketch the proof since it is similar to the proof of Theorem 1.1. Assume first that 
fi(j) > for all j. Let fi = t*5i + (1 — t*)fi' where // is a probability measure with fi'(i) = 0. 
As before write /i* = t5i + (1 — t)fjf for every t. Let s < t* be chosen so that fJ. a (i) = max^yj ^ s (j)- 
From symmetry and monotonicity it follows that 

= *]>!/«■ 

Moreover, the argument in Theorem 1.1 shows that for t in the interval (s,t*) it holds that 

dfi t+h [f = i] t t loglogn 

777 , > Cfi [f = i 1 -n [f = i — , 

on \h=o logn 

which implies that 

\t* -s\< C(log(l - e) - log(l/g))^i^, 

logn 

thus proving the statement of the theorem. 

It remains to remove the assumption that //(j) > for all j. The general case where some 
of the probabilities fi(i) may satisfy < requires one additional step at the cost of taking 
the constant C to be C = C + (q — 1). Instead of the measure fj, we first consider the measure /t 
where = + 1/logn for j ^ i and = fj,(i) — (q — l)/logn. 



7 



Note that if fj, satisfies the conditions of the theorem with the constant C then £i satisfies it 
with the constant C. Moreover > 1/logn for all j. Therefore by the first part of the proof 

A[/ = i] > 1 - e- 
On the other hand, by monotonicity we have 

iAS = i \> £[/ = »1i 

which implies the desired result. □ 



3.2 Graph properties 

Consider a process on the edges of the complete graph K n where each edge e = {a, b} is labeled 
by i with probability fj,(i) for 1 < i < q independently for different edges. This process defines 
sets Ei,... ,E q where Ei is the set of edges labeled by i. In other words, it defines q graphs 
{[n],E 1 ),...,([n],E q ). 

Definition 3.1. A function A from the set of partitions of the edges of K n to q parts into [q] is 
called a graph property if: For all partitions E\, . . . , E q and all permutations a € S(n) it holds that 

A(E l ,...,E q ) = A(a(E 1 ),...,a(E g )), 

where cr(Ei) = {{a(u), a{y )} : {u,v} £ Ei}. 

The function A is called a monotone graph property if 

• For every pair of partitions (E\, . . . , E q ) and (Fi, . . . , F q ) 

• and all 1 < i < q it holds that 

If: 

• Ei ^ Fi and 

• Fj C £Jj for j/i and 

• A(E 1 ,...,E q ) = i, 

Then = i. 

Here are a few examples: 



• Given a graph labeled by [q], let -A(i?i, . . . , E q ) = i, where i is the minimal index for which \Ei\ 
is maximal. In other words, i is the most popular label, where ties are decided by preferring 
the smaller index. 

• Given a graph labeled by [q] let A(E±, . . . , E q ) = i where Ei has the largest clique. Again ties 
are decided by preferring the smaller index. 
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• Given a graph labeled by [q] let A(E\, . . . , E q ) = i where Ei has the smallest independent set 
of vertices. Again ties are decided by preferring the smaller index. 



Theorem 1.1 implies the following: 

Corollary 3.2. There exists a constant C = C(q) such that for every monotone graph property it 
holds that 



4 Indeterminacy for voting methods 
4.1 The setting 

Let P+(X) denote the family of non-empty subsets of X. 

Definition 4.1. Given a set X of m alternatives, a choice function c is a mapping which assigns to 
each nonempty subset S of X an element c(S) £ S. A choice function is thus a map c : P+(X) — > X 
with the additional property that c(S) € S for all S S P + (X). 

A choice function is called rational if there is a linear ordering on the alternatives such that c(S) 
is the maximal element of S according to that ordering. 

A social choice function is a map 

of the form c = F(c\, C2, . . . , c n ) where c is a choice function on X which depends on the profile 
of individual rational choice functions ci, C2, . . . c n for the individuals. 

Note that there are two different meanings for the term "social choice functions" . Sometimes a 
social choice function is referred to as a map which associates to the profile of rational individual 
choices (or preferences) a single "winner" for the society. A social choice function in this sense 
easily defines a social choice function in our sense by restricting to a subset of alternatives. We 
can regard social choice functions as election rules which given a set S of candidates and (strict) 
preference relations of the individuals on the candidates, provides a rule for choosing the winner. 
We regard X as the set of all possible candidates, and S as the set of available candidates, and we 
are interested to understand the society's choice as a function of S. 

The axiom of Independence of Irrelevant Alternatives (Arrow's IIA) asserts that c(A) may 
depend only on the preference relations restricted to the set A. 

We require the stronger property Independence of Rejected Alternatives (IRA), also referred to 
as Nash's IIA. 

(IRA) (Independence of Rejected Alternative (IRA)) c(S) is a function of (ci(S),C2(S), . . . c n (S)) 
Therefore we can write c(S) = Fs(ci(S), c%(S), . . . c m (S)). 
We will require a few more conditions: 



7 [/i : e < P^A = i] < 1 - e] < C(log(l - e) - log(e)) 



log log n 



log n 



(P) (Pareto) c(5) € {a(S), c 2 (5), . . . 



Cn(S)}. 
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4.2 Further assumptions 

We will make the following additional assumptions: 

(U) (Unrestricted domains) The social choice function is defined for arbitrary rational profiles of 
the individuals. 

(Nl') (Neutrality) The social choice is invariant under permutations of the alternatives. 

(A2') (Weak anonymity) The social choice is invariant under a transitive group of permutations of 
the individuals. 

(M') (Monotonicity) The function c(S) is monotone in the following sense: If c(S) = s and Ci(S) = t 
for t ^ s then changing the choice of the i-th individual from t to s will not change c(S). 

4.3 The result 

Theorem 4.1. Let cq be an arbitrary choice function on P + (X) where X is a set of m alternatives 
and let 5 < 1. 

Then there is a probability distribution v = v{cq) on the space of orderings of the alternatives 
and a number N = N(5, m) such that the following holds for every social choice function F which 
satisfies conditions (IRA) (P) (IP) (NP), (A2') and (AP): 

If the number of individuals n is larger than N and if every individual makes the choice randomly 
and independently according to v, then with probability at least 5 for all S £ P+(X) it holds that 
c = F(ci, c n ) satisfies c{S) = cq{S) with probability of at least 5 (here Ci(S) is the highest ranked 
alternative in the order of voter i). 

Corollary 4.2. There exists N = N{m) such that when the number n of individuals is larger than 
N{m), every choice function c on P + {X), where X is a set of m alternatives, can be written as 
c = F(ci, . . . , o„) for every social choice function F satisfies conditions (IRA), (P), (IP) (NP), 
(A2') and (AP) applied to c i, • • • , c n that are determined by rankings. 

4.4 The proof of Theorem 4.1 

Proof. Let cq be an arbitrary choice function and consider a profile with no individuals such that 
the plurality leads to cq. Such a profile exists by Saari's theorem. 

For an ordering tt of the alternatives let w'(n) be the number of appearances of the order tt and 
let w(tt) = w'(Tt)/riQ. 

Consider a random profile (ci,...,c„)onn individuals where for each individual i the probability 
that i-th preference relation c; is described by tt is w(ir) (independently for the individuals). 

We will show that for all 6' < 1 if n = n{5') is sufficiently large, then for S C X it holds with 
probability at least 5' that c(S) = Fs(c\(S), . . . ,c n (S)) satisfies c(S) = co(S) with probability at 
least 5' . This implies the required result by taking 1 — 6' < 2~ m (l — 5). 
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To establish the claim above note that c±(S), . . . ,c n (S) are i.i.d. and that for all a ^ cq(S) it 
holds that P[ci{S) = c (S)] > P[ci{S) = a] + n^ 1 . Further more the function F$ as a functions of 
c\(S), . . . ,c n (S) satisfies neutrality and weak anonymity. Thus from Theorem 1.3 it follows that 
for n large enough the probability that Fs(ci(S), . . . , c n (S)) = cq(S) is at least 1 — 5' as needed. 

□ 

Note that the proof actually implies the following in the spirit of a theorem by Dasgupta and 
Maskin [2]. 

Corollary 4.3. There exists N = N(m) > with the following property: Let U be a set of linear 
orders on m alternatives. If for the plurality rule there is a profile restricted to U which leads to 
a choice function c for the society, then when n > N(m) this is the case for every social choice 
function which satisfies conditions (IRA), (P), (Nl'J, (A2') and (M'j. 

4.5 Taking rejected alternatives into account 

We now describe a crucial example suggested by Bezalel Peleg. First, we note how we can base 
on choices on pairs a choice correspondence: Given an asymmetric binary relation R on the set of 
alternatives let c(S) be the set of elements y of S such that the number of z € S such that yRz is 
maximal. In other words, when we consider the directed graph described by the relation we choose 
the vertex of maximal out-degree. 

Let TZ denote the class of rational choice functions and B denote the class of choice correspon- 
dences obtained from binary relations R as just described. Consider also the class B' of choice 
functions obtained from B by choosing a single element in c(S) according to some fixed order 
relation on the alternatives. The number of choice functions in B' and the number of choice corre- 
spondences in B is exponential in (J) • 

Now describe a social choice function as follows: aRb if a majority of the society prefers a to b. 

In this case the social choice of S does not depend solely on the individual choices for S but 
also on the preferences among pairs of elements in S. 

When the individual choices are rational then the social choice still belongs to the class B (or 
£>'). In this case the choice from S is simply those elements of S which are Condorcet winners 
against the maximal number of other elements in S. In this example the social choice for a set S 
is typically large but this apparently be corrected by various methods of "tie breaking" . 

In these examples the size of the resulting classes of choice functions is exponential in a quadratic 
function of m 2 . It is much smaller than the number of all choice functions which is double expo- 
nential in m. 

The Borda rule can be analyzed by a similar consideration. For this rule c(A) is determined as 
follows: For each alternative a € A let r^(a) be the number of individuals who ranked a in the ith 
place (among the elements of A). Let r(o) = ■ rj(a). The chosen element by the society c(A) 
is the element of a with the minimal weight. 

Another way to describe the Borda rule is as follows: First construct a directed graph (with 
multiple edges) with A as the set of vertices by introducing an edge from a to b for every individual 
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that prefers a to b. Next, define (as before) c(A) as the vertex with maximal outdegree. 

It is easy to prove that the number of choice functions that arise in this way is at most exponential 
in m 3 . (The choice function can be recovered from the sign patters of (less than) 2 m ■ m 2 linear 
expressions in m 2 real variables. 

To summarize, the size of classes of choice functions that arise from a social choice function such 
that c(S) may depend on the individual preferences of the elements of S is at least exponential in 
m 2 and this bound is sharp. 

5 A generalization of Talagrand's result 

In this section we prove the bounds on L 2 influence sums for arbitrary probability spaces. For this 
we first recall the notion of Efron-Stein decomposition [3] then generalize Talagrand's result [10] 
and finally derive corollaries for {0, l}-valued and symmetric functions. 

5.1 Efron-Stein decomposition 

Consider finite probability spaces fii,...,fi n , with measures /xi ,...,/%. Let cti be size of the 
smallest atom of (fij,/^), and set a = minjOtj. Let / £ L 2 ([li^i) be a real valued function. Write 
/ = Y2sc[n] fs f° r ^ ne Efron-Stein decomposition of / which we now recall. 

Definition 5.1. Let (Cl±, /xi), . . . , (O n , fj, n ) be discrete probability spaces = n£=i(^ij A 4 *)- 

The Efron-Stein decomposition of / : — > R is given by 

fix) = Y, fsfrs), (3) 

SC[n] 

where the functions fs satisfy: 

• fs depends only on xs- 

• For all S % S' and all xs> it holds that: 

E[fs\X s > = xs>} = 0. 

It is well known that the Efron-Stein decomposition exists and that it is unique [3]. We quickly 
recall the proof of existence. The function fg is given by: 

f S (x) = J2 (-i) lsxs ' l nf(x)\x s , = x s ,\ 

s'cs 

which implies 

J2 fsb) = y, nf\x S ' = x\ J2 = E im n] = x w ] = f(x). 

S S' S:S'CS 
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Moreover, for S % S' we have E[fs\Xs' = xs'] = E[fs\Xs'ns — x S'ns] and for S' that is a strict 
subset of 5 we have: 

E[f s \X s , = x S '} = Yl (-l) |5V5 "'E[/(X)|X5»nS' = x s »ns'} 
s"cs 

= ]T B[f(X)\X s „ = x s »] £ (-i) 15 ^ 1 = o. 
S"cS' S"cscs"u(s\s') 

5.2 Generalization of a Result of Talagrand 

We now prove: 

Theorem 5.2 (Generalization of Talagrand, 1994). There exists some universal constant C such 
that for any probability spaces (O, (J,) and any function f € L? (f2 n , fi n ) it holds that 

Var(/) < Clog(l//i*) £ ? l|AJIIi ^ . 

^log(||A i /|| 2 /||A i /|| 1 J 

w/iere A*/ = J2s-.iesfs- 

The proof is almost identical to Talagrand's proof using Efron-Stein decomposition instead of 
Fourier expansion and known bounds on the hyper-contractive constants of finite probability spaces. 
In particular we'll use the following result of Wolff [11]: 

Theorem 5.3. For g : Q n -> R, let: 

s 

Then for all g it holds that: 

\\T a gh < hh/2, 

for all 

/ (1 - q)2~4/3 _ Q 2-4/3 1/2 

° ~ V (1 - a)a!- 4 / 3 - a(l - a) 1 " 4 / 3 J ' 

Remark 5.4. In particular one may take a = a 2 /6. 
Proof Let 

= x 2 /3 j 5 (x) = -(1 -x)/x 1/3 . 
By Lagrange's theorem we may take: 



a 2 



f(6) 



for some £1,^2 € («> 1 — a). Clearly 

/'(a;) = (2/3)*- 1 / 3 , = (2/3)x~ 1 / 3 + (1/3)*" 4 / 3 , 
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are decreasing, and therefore 



2 f'(l -a) 2(1 -ay 1 / 3 2 a 2 

° ~ g'(a) ~ 2a- 1 / 3 + a~ 4 / 3 ~ 3a" 4 / 3 - ~Q~ 



□ 



We now prove Theorem 2.1. 
Proof. For a real function 5 from our space, denote 



So 



Issll! 



1*1 



Var[/] = ^||/ 5 ||i = ^M 2 (A l /). 
5^0 i=l 

Note that the statement of the theorem follows if we prove that for any function g with Eg = 0, 

M\g) < 10og(l/a) r- . (4) 



The statement of the theorem follows by apply (4) to Aj/ and summing the inequalities. To prove 
(4) we use hypercontractivity. The following proposition is proved in the end of this note. 

Applying Theorem 5.3 gives that for any integer k > 0, 

2k IU II 2 S „2\S\ ||„ ||2 \\rr i|2 IUI|2 

0<|S|=fc s 

hence 

E ll2 / 6 \ fc n o 
ll<7s|| 2 < ("J J lblls/2 

|S|=fc 



Fix an integer m > 0, and sum the previous inequality for all k < m to get 



k f \ m 

TV 



12 I 6 I 2(4 

\\9S\\2 ^ n u2 ^ V a V t| ||2 

|5|<m 1 1 k<m 



where the last inequality comes from the fact that the ratio between two consecutive summands in 
the sum is greater than 2. We now have 



a 1 m 
O 



\\n 112 iu ||2 2 Mr 1U112 

m 2 ( 5 ) = E %r+ E %r< ^^115111/2 + ^ 

\S\<m 1 1 \S\>m 1 1 



< 2 r/ 6 \m 
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(\ m 
c? ) II5II3/2 — ll^lli' hence 



6 \m+l 21og(||5|| 2 /||5 
~ 2 ) IMI 3/2 >IM| 2 => rn+l> 



Plugging this back into (5) gives 

M 2 (g) < C 

log (IMI2/IMI3/2 

An application of Cauchy-Schwartz gives 

J l3/ 2 ^ Il«/Ili||y|l2 ' 



I5II3/2 < NlilM' 2 



hence 



l5ll3/2\ 3 Hslli 



V IISII2 > \\9\\l 

which concludes the proof of (4) and so we are done. 

5.3 The formula for {0, 1} valued functions 

For {0, 1} valued functions, Theorem 2.1 has a very simple formulation. 
Lemma 5.5. Let f : n£=i ft< ->■ {0, 1}. Then 

||A i /||i = 2/ i (/). 
Proof. Let /:[?]—>■ {0, 1} be a function with E[/] = p. Note that 

E[|/ - p\] = P (l -p) + (l- p)p = 2Var[/]. 

Since 

Aj/ = / — E[/|Xi, . . . , . . . , X n ], 

we see that 

S[|A i (/)||X 1 ,... ) X i _ 1 ,X i+1) ...,X ri ] = 2Var[Ai|X 1 ,...,X i _ 1 ,X i+1 ,X ri ], 
and therefore taking expected value of X\, . . . , Xi—i,Xi+i,X n we obtain that 

||Ai/||i = 2hU). 



We now obtain the following corollaries 
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Corollary 5.6. There exists some universal constant C such that for any probability spaces (£l,n), 
and any function f : Q n — > {0, 1} it holds that 

In particular if Ii{f) < S for all i then: 

|>(/) > 2C log(l/^) {l ° g{1/5) ~ lo g(V4))Var M (/). 

Corollary 5.7. There exists some universal constant C such that for any probability space (0,/i) 
and any function f : £l n — > {0, 1} which is symmetric it holds that it holds that 

f>(/)> °. , lognVar M (/). 
^ log(l/a) 



=i 



Proof. If Ij > the claim follows immediately. Otherwise we apply the previous corollary and 
note that log(l/<5) = Sl(logn). □ 

Remark 5.8. It is a fundamental question both for the Boolean case and for the case of larger 
alphabet that we consider here to find conditions (and even appropriate definitions) for sharp 
thresholds for monotone functions. One may want to obtain similar result on the basis of the fact 
that the function / has low influences. However, this condition on its own does not suffice. Let 
m : {—1, l} 2n — > {—1, 1} be the majority function defined as follows: 

2n 

m n {x) = sign(y^Xj). 

i=l 

When the sum is the function is defined arbitrarily in such a way that it is —1 on half of the 
balanced inputs and 1 on the other half. We write vectors (x, y) £ {—1, l} 2n where x and y are 
two vectors of length n. Let f n = m(x, —y). Then clearly all of / influences are of order at most 
n -1 / 2 . On the other hand, it is easy to see that lim„_ 5 . 00 E p ( n )[/] = for all functions p(n) satisfying 
lim n _ 5 . 00 np{n) = and linin^oo n(l — p(n)) = 0. 
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